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ABSTRACT 

Interferon regulatory factors IRF-3 and IRF-7 are 
transcription factors essential in the activation of 
interferon-p (IFN-|5) gene in response to viral infec- 
tions. Although, both proteins recognize the same 
consensus IRF binding site AANNGAAA, they have 
distinct DNA binding preferences for sites in vivo. 
The X-ray structures of IRF-3 and IRF-7 DNA 
binding domains (DBDs) bound to IFN-p promoter 
elements revealed flexibility in the loops (L1-L3) 
and the residues that make contacts with the 
target sequence. To characterize the conformation- 
al changes that occur on DNA binding and how they 
differ between IRF family members, we have solved 
the X-ray structures of IRF-3 and IRF-7 DBDs in the 
absence of DNA. We found that loop L1, carrying the 
conserved histidine that interacts with the DNA 
minor groove, is disordered in apo IRF-3 but is 
ordered in apo IRF-7. This is reflected in differences 
in DNA binding affinities when the conserved histi- 
dine in loop L1 is mutated to alanine in the two 
proteins. The stability of loop L1 in IRF-7 derives 
from a unique combination of hydrophobic residues 
that pack against the protein core. Together, our data 
show that differences in flexibility of loop L1 are an 
important determinant of differential IRF-DNA 
binding. 

INTRODUCTION 

The interferon regulatory factor (IRF) family of proteins 
plays an essential role in the activation and regulation of 
immune response genes implicated in both innate and 
adaptive immunity (1-5). In addition, some members of 
this family have critical roles in the differentiation and 
development of hematopoietic cells and in the regulation 



of apoptosis (6,7). In mammals, nine IRF proteins have 
been identified so far and all share the same general archi- 
tecture, with a highly conserved N-terminal domain of 
approximately 120 residues, which is involved in DNA- 
specific binding and a variable C-terminal domain or 
IRF association domain (IAD) that mediates not only 
homo or hetero-oligomerization among IRF factors, but 
also mediates association with other transcription factors 
and co-activators, like CBP/p300 (8,9). Despite the high 
degree of structural similarity among the different IRF 
DNA binding domains (DBDs), there are important dif- 
ferences; for instance, IRF-4 contains an N-terminal ex- 
tension of 20 amino acids that inhibit DNA binding (10) 
and IRF-7 has a recognition helix a3 that is longer than 
other members of the IRF family (11). Nevertheless, 
all IRF proteins recognize a DNA element with a consen- 
sus sequence AANNGAAA. This sequence is found in a 
multitude of promoters from interferon response genes, 
usually containing two or more of these elements 
(12-14). The crystal structure of IRF-1 DBD bound to 
DNA (15), revealed that the IRF DBD consists of a 
modified version of the helix-turn-helix (HTH) motif 
that includes a four-stranded antiparallel (3-sheet and 
three large loops (L1-L3) connecting the different sec- 
ondary structural elements. Subsequent structures of 
IRF-2 (16), IRF-4 (10), IRF-3 (17) and IRF-7 (11) 
bound to DNA have confirmed that DNA recognition is 
achieved by (i) conserved residues on helix a3 (Arg, Cys, 
Asn) interacting with GAAA bases (AANNGAAA) in the 
major groove and (ii) a conserved His on loop LI 
protruding into the minor groove and making water- 
mediated contacts with two consecutive A:T steps 
upstream of the GAAA core sequence (AANNGAAA). 
These sets of interactions are further stabilized through 
a network of hydrogen bonds between the IRF proteins 
and the DNA phosphate backbone. The recent structures 
of IRF-3 DBD bound to the PRDI PRDIII 
DNA element of the IFN-P enhancer and the structure 
of IRF-3, IRF-7 and NF-kB DBDs bound to the 
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PRDII-PRDIII element have provided insights on the 
basis of cooperativity when two or more IRF DBDs 
bind to natural enhancers and promoters (11,18). From 
the structures, due to the overlapping nature of the 
individual sites, the binding of one IRF molecule affects 
the binding of the second molecule primarily through 
DNA bending rather than direct contacts with each 
other. Also, IRF-3 appears to be capable of recogniz- 
ing both consensus and non-consensus sites and IRF-7 
can better accommodate sites with G:C base pairs 
upstream of the AA element. In order to fully understand 
the structural and thermodynamic determinants of IRF 
DNA recognition, it is important to characterize the struc- 
tural changes that occur upon DNA binding. To address 
this issue, we have solved the structures of IRF-3 and 
IRF-7 DBDs in the absence of DNA. Our data shows 
that differences in the flexibility of loop LI in IRF 
proteins play an important role in DNA-specific binding. 

MATERIALS AND METHODS 

Protein purification 

The cDNA fragments encoding human IRF-3 DBD 
(residues 1-113) and mouse IRF-7 DBD (residues 1-134) 
were generated by PCR and cloned in pET-15b vectors 
(Novagene). All point mutations were introduced by the 
Quick-change site-directed mutagenesis kit (Stratagene). 
Proteins were purified as previously described (18). Briefly, 
the proteins were expressed in the Escherichia coli strain 
BL21(DE3) pLyS (Stratagene). Cells were grown in LB 
medium supplemented with ampicillin (100(ig/ml) at 
37°C to OD 600 0.5 and induced with 0.4 mM IPTG for 
3 h. Clarified lysates were applied to nickel affinity columns 
(Qiagen) obtaining protein up to 90% pure as judged by 
SDS-PAGE. The His-6 affinity tags were removed by 
thrombin digestion (Sigma). The proteins were then 
loaded onto a SP-Sepharose column (GE healthcare) 
equilibrated with 25 mM HEPES pH 7.0, 50 mM NaCl, 
1 mM DTT and eluted using a linear gradient of 0- 
1.0 M NaCl in buffer. Further purification was carried 
out using a SD75 gel filtration column (GE healthcare) 
equilibrated with a buffer containing 25 mM Tris-HCl 
pH 7.5, 300 mM NaCl and ImM TCEP. The purified 
proteins were concentrated and used for crystallization. 

Crystallization and data collection 

Selenomethionine (Semet) crystals of IRF-3 DBD were 
obtained by mixing equal volumes of protein solution 
(30mg/ml) and reservoir solution containing 6% PEG 
1000, 100 mM sodium acetate (pH = 6.0), 300 mM zinc 
chloride and 8% cadaverine. Crystals grew at 20° C 
within 2 days and belonged to space group P3 2 21 with 
unit cell dimensions of a = b = 64.90 A, c = 157.62 A. 
Multiwavelength anomalous diffraction data (MAD) 
were measured from a single frozen crystal at the 
Advance Photon Source (APS, beamline 17ID) of 
Argonne National laboratory. Data were measured at 
three wavelengths, corresponding to the inflection 
(0.9796 A) and peak (0.9791 A) of the selenium edge ab- 
sorption profile plus at remote point (0.9537 A). IRF-3 



crystals diffracted to 2.1 A resolution at APS. The data 
were truncated to 2.3 A, due to the presence of an ice ring 
in the 2.1-2.25 A resolution range. 

IRF-7 DBD apo crystals were obtained mixing two 
volumes of protein solution (46mg/ml) with one volume 
of reservoir solution containing 18% PEG 1500. The 
crystals grew over night and belonged to the trigonal space 
group P3i with unit cell dimension of a = b = 68.59 A, 
c = 68.61 A. Crystals were flash frozen in liquid nitrogen 
using mother liquor plus 20% glycerol as a cryoprotect- 
ant. Diffraction data were collected at National 
Synchrotron Light Source (NSLS, beam lines X6A and 
X29) at k = 1.0 A. The crystals diffracted to 1.3 A reso- 
lution. The data were integrated and reduced using 
HKL2000 package (19). 

Structure determination 

The structure of IRF-3 DBD was solved by molecular 
replacement (MR) using one of the IRF monomers from 
the IRF-3/PRDIII-PRDI complex (PDB id 2PI0) (18) as 
a search model. The initial MR solution was obtained 
using the program PHASER (20) and included only two 
molecules in the asymmetric unit. The inspection of 
density map calculated with this solution suggested the 
presence of another IRF-3 molecule in the asymmetric 
unit, which was found by the program MolRep (21). An 
anomalous difference Fourier map was used to locate the 
positions of selenium (and zinc) atoms. Structure refine- 
ment was carried out by rounds of energy minimization 
and B-factor refinement using the CNS package (22). 
Manual model rebuilding was performed with the program 
COOT (23) using composite omit maps generated by 
CNS. Final cycles of restrained refinement were per- 
formed using the programs REFMAC (24) and 
PHENIX (25). The final model contains three monomers 
in the asymmetric unit, named IRF-3A (amino acids 5-40 
and 49-110), IRF-3B (amino acids 5-39 and 49-110), 
IRF-3C (amino acids 5-39 and 49-110); 9 zinc ions; 3 
sodium ions; 7 chloride ions and 151 water molecules 
(Figure 1A). The model presents excellent geometry, 
with the Ramachandran analysis indicating that 94.6% 
of the residues are in the allowed regions and 5.4% are 
in the preferred regions (Table 1). 

The structure of mouse IRF-7 DBD apo form was 
solved by MR using a homology search model from the 
human IRF-7 DBD bound structure (PDB id 2061) (11), 
generated by the program CHAINSAW (26). The initial 
MR solution was obtained with the program PHASER, 
which suggested three molecules in the asymmetric unit. 
After a round of rigid-body refinement, the i? fact and R tTee 
values were 47 and 48%, respectively. A significant twin 
fraction (a = 0.36) was detected by the merohedral crystal 
twinning server (http://nihserver.mbi.ucla.edu/twinning/) 
and Xtriage program in the Phenix suite (25). 
Reflections for the free set were thus selected using the 
highest lattice symmetry [P3(bar)m] in order to avoid a 
bias in the calculation of Rf ree , due to the pseudo- 
symmetry generated by the twinning. In the final step of 
refinement, twining was included, with a significant improve- 
ment of the refinement statistics (final R {ree = 20.0%). 
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Figure 1. Crystal structures of human IRF-3 DBD and mouse IRF-7 DBD. (A) Ribbon representation of three IRF-3 DBDs in the asymmetric unit. 
IRF-3A is colored red, IRF-3B is colored blue, and IRF-3C is colored purple. Zinc atoms are colored green. Loop LI is represented in dashed lines. 
(B) Ribbon representation of IRF3-A. Chain was colored according to the secondary structure assignment in the alignment (see below). Missing 
residues in loop LI are represented in dashed lines. (C) Structural based sequence alignment of IRF DBDs. Missing residues in the apo IRF-3 DBD 
structure are highlighted in red. The secondary structure elements in IRF-3 DBD are shown on top of the alignment and colored according to B. 
Missing region in loop LI is shown in dashed lines. Unique residues in LI of IRF-7 are highlighted in green. (D) Ribbon representation of three apo 
IRF-7 DBDs in the asymmetric unit. IRF-7A is colored green, IRF-7B is colored cyan, and IRF-7C is colored purple. (E) Ribbon representation of 
IRF-7A. The chain was colored according to the secondary structure assignment in the alignment. 



Iterative rounds of restrained refinement and building 
were performed in the same manner as IRF3-DBD apo. 
The final model contains three monomers in the asymmet- 
ric unit, named IRF-7A (amino acids 9-130), IRF-7B 
(amino acids 9-125) and IRF-7C (amino acids 8-125); 
287 water molecules and 3 sodium ions. The model has 
excellent geometry with 98.0% of the residues in the most 
favored regions of the Ramachandran plot, and no 



residues in the disallowed regions (Table 1). All figures 
were made with the PyMol solftware (27). 

Fluorescence anisotropy measurements 

6-Carboxyfluorescein (6FAM) labeled (5'-6FAM-GAGA 
AGTGAAA GTGG-3Q and unlabeled complementary 
DNA oligonucleotides containing the PRDI consensus 
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Table 1. Data collection and refinement statistics 



Data collection 


hIRF3-DBD 


mIRF7-DBD 


Space group 


P3 2 21 


P3i 


Molecules in AU 


3 


3 


Cell parameters (A) 


a = b = 64.91, 


a = b = 68.59. 


Resolution (A) 


c = 157.62 


c = 68.61 


38.3-2.1 


50.0-1.3 


Total reflections 


314455 


559 270 


Unique reflections 


25 580 


85 725 


Data coverage (%) 


98.1 (99.7) a 


96.7 (94.2) b 


^sym (%) 


7.6 (35.2) a 


7.7 (41.8) b 


Completeness (%) 


97.97 


96.64 


//a 


27.6 (6.7)" 


21.3(3.1) b 


Refinement statistics 






Resolution range 


38.3-2.3 


30-1.3 


Reflections 


17 524 


85 702 


^cryst (%) 


21.2 


16.2 


-ftfree (%) 


26.4 


20.0 


Non-hydrogen atoms 


2552 


3234 


Protein atoms 


2382 


2944 


Solvent atoms 


151 


287 


Metals atoms 


19 


3 


Average B factors (A ) 


37.5 


23.5 


RMSD bonds (A) 


0.009 


0.008 


RMSD angles (°) 


1.198 


1.029 


Ramachandran quality plot 






Most favored (%) 


94.6 


98.0 


Allowed (%) 


5.4 


2.0 


Disallowed (%) 


0.0 


0.0 



"High-resolution shell (2.2-2.1 A). 
b High-resolution shell (1.35-1.30 A). 
R S ym = 100 x XXI /_ WD/EW- 

i? fact =100x T,(i\F 0 \-\F c \\)/j:(\F 0 \), where F a and F c are the 
observed and calculated structure factors, respectively (F 0 > 0a). R( ree 
was determined from ~5% of the data 



site were purchased form IDT DNA Technologies 
(Coralville, IA, USA). Oligos were resuspended in 
1 xTE buffer supplemented with 100 mM NaCl. DNA 
duplex were formed by heating to 95°C a mixture of one 
equivalent of the 6-FAM-labeled strand with one equiva- 
lent of the complementary strand and permitting the 
sample to cool down to room temperature. Each 
reaction sample (volume 150ul) consisted of 5nM of 
5'-Fluorescein-labeled DNA and increasing concentra- 
tions of IRF-3 DBD or IRF-7 DBD (0.5 nM-300 uM) in 
a buffer containing 25 mM HEPES pH 7.0, 50 mM NaCl. 
Reactions were incubated at 20° C for 15min prior to 
measurement. Fluorescence anisotropy intensity data 
were collected on a Beacon 2000 fluorescence polarization 
system (Invitrogen) by setting excitation filter at 490 nm 
and emission filter at 520 nm. The fraction of DNA bound 
(B) was calculated using the following equation: 

B= (IA] X - [A] DNA ) 

([A]final " [A] DN a) 

where [A] x equals the anisotropy measured at protein con- 
centration X, [A] DN a is the anisotropy in absence of 
protein and [A] final is the anisotropy value at saturation. 
Each data point was calculated from an average of five 
anisotropy measurements. DNA fraction bound values 
were then plotted versus domain concentration and the 



data were fitted by hyperbolic regression, using Origin 
6.1 (OriginLab), to the following equation: 

Bmax* Do 
B = - 

where B is the fraction of DNA duplex bound, D 0 is the 
total concentration of the DBD, and K d is the dissociation 
constant of the complex. 

RESULTS 

Overview 

As in previous structures of IRF DBD/DNA complexes, 
the apo forms of IRF-3 and IRF-7 DBD's retain an a/ (3 
architecture consisting of three a helices (al-a3) flanked 
by a four-stranded antiparallel P-sheet (pi-(34), a variant 
of the helix-turn-helix (HTH) motif found commonly in 
transcription factors (Figure IB and E) (28). As is char- 
acteristic for this family, three large loops (LI- L3) 
connect different parts of the secondary structure. Loop 
LI connects P2 and a2, loop L2 connects a2 and oc3, and 
loop L3 connects P3 and P4. The electron density for 
apoIRF-3 DBD is well defined with the exception of 4 
residues at the N-terminus and approximately 10 
residues from loop LI. In the case of IRF-7 DBD, only 
the eight residues at the N-terminus are not defined. The 
apo IRF-3 and IRF-7 DBD structures superimpose with 
an RMSD of 1.39 A, but if the superposition is performed 
with only the three a-helices and the P-sheet, the RMSD 
decreases to 1.1 9 A. The largest deviations between the 
two structures occur in loops LI and L2. 

IRF-3 DBD apo form 

The three IRF-3 DBD molecules in the crystallographic 
asymmetric unit are arranged with their loops (L1-L3) 
and helix a3 exposed to the solvent while the rest of the 
molecules pack against each other. Interestingly, the apo 
IRF-3 DBD crystals grow only in the presence of ZnCl 2 . 
We identified nine Zn +2 ions via their anomalous signal 
and together these ions mediate intermolecular inter- 
actions between the three symmetry-related molecules 
by coordinating a set of Asp, Glu and His residues 
(Figure 1A), as well as chloride ions. The coordination is 
typically tetrahedral with an average Zn 2+ -ligand distance 
of 2.1 A (29). A superposition of the three molecules yields 
a low RMSD value of 0.5 A 2 for 98 Coc's. The largest 
deviation occurs in the linker between helix a3 and 
strand P3 (RMSD of 1.47 A) of molecule C and molecules 
A/B. A notable feature of the structure is the absence of 
density for most of loop LI (amino acids Gly41-Asn48) in 
all three molecules of the asymmetric unit (Figure 1A). 

In the previous IRF-3-DNA complex structures, loop 
LI interacts with the DNA minor groove, with residues 
Trp38, Gly41, Leu42 and Gln44 contacting the DNA 
backbone and His40 making water-mediated contacts 
with the two contiguous A:T base pairs upstream of the 
GAAA core sequence (AANNGAAA) (11,17,18). The 
loop is somewhat flexible in the DNA complexes (average 
B-factor of ~70 A2) but, as we show here, in the absence 
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of DNA, it is largely disordered (Figures IB and 2 A). 
Also, residues that flank the loop deviate substantially in 
conformation from that observed in complex with DNA. 
For example, the Ca of Phe51 is shifted by ~ 10 A and the 
Ca of His40 is displaced by ~3.7A. Also, side chain 
density of His40 is defined only in one of the three mol- 
ecules in the asymmetric unit (Figure 2A). With the excep- 
tion of loop LI (and the flanking residues), the rest of the 
apo structure (including loops L2 and L3) superimposes 
well with the DNA complex, with an RMSD of ~ 1.06 A 
for 98 Ca's (Figure 3A). Most of the conformational 
changes are limited to the local rearrangement of side 
chains. Most notably, Lys77, Arg78, Arg81 and Arg86 
(on helix oc3) that make in contacts with the DNA in the 
complex adopt different (or multiple) conformations in 
the absence of DNA (Figure 3 A) (18). 




IRF-3 DBD apo 




IRF-7 DBD apo 

Figure 2. Structure of loop LI in IRF-3 DBD and IRF-7 DBD. Stereo 
view of the 2F o —F 0 electron density map for loop LI region of IRF-3A 
DBD (A, red) and IRF-7A DBD (B, green), contoured at la. For 
clarity LI is colored in yellow in both structures. IRF-7C DBD is 
colored cyan (B). Note that loop LI in IRF-3 DBD lacks density 
form residues 41-48, whereas in IRF-7 DBD there is good definition 
for the density. 



IRF-7 DBD apo form 

Although the IRF-7 DBD crystallizes in a different space 
group than the IRF-3 DBD there are again three mol- 
ecules in the asymmetric unit (Figure IB). The three mol- 
ecules superimpose with a maximum RMSD of ~0.5 A for 




hlRF3 Apo 
hlRF3 Bound 




His100 



Lys92 
Thr93 



mlRF-7 Apo 
hlRF-7 Bound 

Figure 3. Conformational changes in IRF-3 DBD and IRF-7 DBD 
upon DNA binding. (A) Structural comparison between apo and 
DNA-bound forms of IRF3-DBD. Apo IRF-3 DBD superimposes 
with an RMSD of 1.04 A on the DNA bound form (PDB ID = 2PI0 
chain B). Note the disordered region in loop LI, represented by a 
dashed line in apo IRF-3 DBD. Side chains of His40 and Leu42 on 
loop LI and Lys77, Arg78, Arg81 and Arg86 on Helix oc3 are repre- 
sented by sticks. DNA molecule from the complex is colored in grey, 
where the core recognition sequence (AANNGAAA) is highlighted in 
yellow. (B) Structural comparison between apo and DNA-bound 
forms of and IRF-7 DBD. Apo mouse IRF-7 DBD superimposes 
with an RMSD of 1.41 A on the DNA-bound form of human IRF-7 
DBD (PDB ID 2061). Side chains of His44 and Phe45 on loop LI 
and Lys92, Thr93 and HislOO on helix ot3 of mouse IRF-7 (and 
counterparts on human IRF-7) are represented by stick. Note that 
helix a.2 is extended and all three loops (L1-L3) are ordered in apo 
IRF-7 DBD. 
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117 Ca's. In contrast to the apo IRF-3 DBD, loop LI is 
well defined (Figure IE and 2B). This ordering of loop LI 
in IRF-7 does not appear to be a crystallization artifact, 
as the conformation of the loop is almost identical in 
all three molecules in the asymmetric unit despite their 
different crystal packing environments. Loop LI in 
IRF-7 differs from that in other IRF family members in 
containing a phenylalanine (Phe45 instead of an alanine) 
after the conserved histidine (His44) (Figure 1C). Phe45 
packs against Leu50 (also unique to IRF-7) in loop LI 
and Phe58 in helix oc2, as part of the hydrophobic core 
of the IRF-7 DBD (Figure 2B). Together, these hydropho- 
bic interactions appear to confer a more restrictive con- 
formation to loop LI, which is reflected in lower average 
B-factors for this region (~25 A 2 ) compared with the same 
region in IRF-3. 

Loop L2 is much longer in the IRF-7 DBD than in 
other IRFs and has been proposed to interact with the 
p50 subunit of NF-kB when the two transcription factors 
bind to adjacent sites on the interferon-P enhancer (10). 
Because of its length and the abundance of proline 
and glycine residues, it is the most flexible segment of 
the IRF-7 DBD structure. Indeed, in the structure of 
IRF-7 bound to DNA, loop L2 is partially unstructured 
(11). In the apo structure, loop L2 is well defined but 
has the highest average B-factors (-29 A 2 ) (Fi gure 3B). 
Interestingly, residues 71-78 in L2 are unique to IRF-7 
and adopt an extended conformation characteristic 
of poly glycine and poly proline chains (Figure 3B). 
This extended conformation appears to be important 
in allowing Arg67 on loop L2 to contact Asn75 on 
NF-kB. 

Intriguingly, the recognition helix a3 in the apo struc- 
ture is longer at the N-terminus by three amino acids 
when compared to IRF-7 DBD in the presence of DNA 
(Figure 3B). Among these amino acids is Arg89, which 
in the DNA complex makes contacts with the DNA 
backbone but in the absence of DNA is directed toward 
the interior of the protein and makes contacts with struc- 
tural water in the protein cavity. Interestingly, Arg89 
in apo IRF-7 adopts a similar conformation as Lys92 in 
DNA complex, pointing to a switch in position between 
the two residues upon DNA binding. One feature of 
helix a3 that is conserved in the apo and DNA bound 
forms of IRF-7 is a kink in the helix at Gly90, indicating 
that this bending is not a consequence of the DNA 
binding. 

We also identify a putative metal ion between helix oc3 
and strand (53 in the apo structure that is coordinated with 
carboxylic groups of Leu99, Glyl03 and Phel05. The 
metal (possibly Na + from the crystallization mix) appears 
to compensate for the large dipole moment of long helix 
oc3. A similar metal binding site is located in the DNA 
bound structure of IRF-2 (16). 

Mutations in loop LI 

The structures of all IRF/DNA complexes have shown 
that a conserved histidine on loop LI (His40 in IRF-3 
and His44 in IRF-7) partially enters the DNA minor 
groove and makes water mediated contacts with A:T 



base pairs upstream of the GAAA core sequence 
(AANNGAAA). The observed difference in flexibility of 
loop LI in IRF-3 and IRF-7 DBDs raises the question of 
the effect of mutating residues in loop LI on DNA 
binding. To address this, we constructed two mutants: 
(i) IRF3-H40A, in which His40 in IRF-3 DBD is 
mutated to alanine, (ii) IRF7-H44A, in which His44 in 
IRF-7 DBD is also mutated to alanine. The wild-type 
(WT) and mutant proteins were tested for DNA binding 
by fluorescence anisotropy. Figure 5 shows the result 
obtained using the PRDI DNA element with sequence G 
AGAAGTGAAAGT. Our experiments show that IRF-3 
DBD wt binds with a dissociation constant of 486 nM, 
while IRF-7 DBD wt binds with a slightly lower affinity 
of 630 nM. These values are consistent with previously 
reported affinities of IRFs proteins for the PRD-I 
element (30). However, the affinity of IRF-3-H40A de- 
creases by ~3. 6-fold (K d = 1706 nM), IRF-7-H44A binds 
with almost the same affinity (A" d = 532 nM) as the WT 
protein. 



DISCUSSION 

We present here the first crystal structures of IRF-3 and 
IRF-7 DBDs in the absence of DNA. Superposition of 
apo and DNA bound IRF-3 DBDs reveals two primary 
conformational changes on DNA binding: (i) the 
N-terminus of IRF-3 undergoes a disorder to order tran- 
sition, with Lys5 making contacts with DNA along 
the minor groove, (ii) loop LI becomes ordered and ap- 
proximately 10 residues that are not visible in the apo 
IRF3 structure are well defined in the DNA-bound struc- 
ture. In contrast to IRF-3, the LI loop is ordered in the 
apo IRF-7 DBD structure and the major change is a re- 
arrangement of the loop as a rigid body around the minor 
groove. The inherent flexibility of loop LI is consistent 
with the apo NMR structure of IRF-2 DBD, wherein 
LI displays a large number of conformations (31). A 
conserved PWKH motif in loop LI appears to be import- 
ant in positioning LI for its interaction with DNA. That 
is, the proline (Pro37 in IRF-3) fixes the adjoining trypto- 
phan (Trp38) in an outward conformation for interaction 
with the DNA backbone. The indole ring of this trypto- 
phan moves by about 1.6 A in order to interact with a 
DNA phosphate group. At the same time, the main chain 
carbonyl group of Lys39 is positioned to make a hydrogen 
bond with Lys77 (on the recognition helix a3), providing 
stability to the overall loop conformation. Notably, the 
imidazole ring of His40 swings by almost 6 A into the 
DNA minor groove in order to make water-mediated 
contacts with A:T base pairs, while the main chain 
carboxyl group of His40 interacts with the main chain 
amide group of Arg43 to further stabilize the loop 
(Figure 4A). 

In contrast to IRF-3, the LI loop is ordered in the apo 
IRF-7 DBD structure and stabilized in part by two hydro- 
phobic residues (Phe45 and Leu50) that fold back into the 
core of the protein. Upon DNA binding, the loop 
undergoes a rigid body transition of ~2 A that pivots 
around two points at both ends of the loop. At the 
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Figure 4. Structural comparison of Loop LI between apo and DNA bound form of IRF DBDs. (A) Close up view of Loop LI of IRF-3 DBD apo 
(magenta) and bound (green) forms. Conserved histidine residue (His40) is shown in stick representation. (B) Close up view of loop LI of IRF-7 
DBD apo (blue) and bound (orange) forms. Conserved histidines (His46 on human and His44 on mouse) are shown in stick representation. For 
clarity, DNA is not shown. Loop LI and the conserved histidine adopt similar conformations when bound to DNA, but adopt different (IRF-7) or 
disordered (IRF-3) conformations in the absence of DNA. 



N-terminal end, the Ca of the conserved residue Trp42 
moves by about 1 A in order to interact with the DNA 
backbone. Moreover, the displacement of Trp42 pushes 
against the other pivot site in residue Phe60. Similarly to 
IRF-3 DBD, the imidazole ring of the conserved Histidine 
residue (His44) rotates around the chil bond by ~25° to 
position itself to interact with the minor groove. The dif- 
ference in the inherent flexibility of loop LI between 
IRF-3 and IRF-7 seems to be reflected on the effect of 
His mutations on DNA binding. Thus, while mutation 
of His40 to alanine significantly decreases DNA binding 
of IRF-3 by ~3. 6-fold, the equivalent mutation in IRF-7 
shows no significant change in binding (Figure 5). The 
decrease in binding with the IRF-3 H40A mutation 
likely represents the loss of interactions that stabilize 
loop LI upon DNA binding, including the interactions 
between His40 and the DNA minor groove. However, 
because loop LI is already ordered in apo IRF-7 and 
His44 is involved in interactions with Serll2 and several 
water molecules, it is conceivable that there is less of a gain 
in enthalpy when His44 (compared to His40 in IRF-3) 
interacts with DNA. Our data suggest that differences in 
the inherent flexibility of loop LI between IRF-3 and 
IRF-7 have a direct effect on DNA binding and may 
play a role in the distinct DNA binding specificities 
observed between the two proteins. Indeed, there is some 
indication from biochemical studies that IRF-3 and IRF-7 
interact differently with DNA. IRF-7, for example, is 
more tolerant than IRF-3 to changes in the AA 
sequence upstream of the GAAA core (AANNGAAA) 
suggesting a somewhat looser interaction with DNA 
(9,31). To more fully explore the differences between 
IRF-3 and IRF-7 in DNA selection will require additional 
mutations and a more detailed thermodynamic analysis on 
several DNA sites. 

Taken together, we show here that the structures apo 
IRF-3 and IRF-7 DBD's are generally similar to the DNA 
bound forms, including a kink in the recognition helix a3 
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Figure 5. Effect of mutations in loop LI on DNA binding. Isothermal 
binding curves over the PRDI consensus DNA site for IRF-3 DBD 
wild type (black squares), IRF-3 DBD H40A (red dots), IRF-7 DBD 
wild type (green triangles) and IRF-7 DBD H44A (blue inverted tri- 
angles). The calculated K^s are shown in the table. 



and ordered loops L2 and L3. The primary difference 
is in loop LI, which undergoes a disorder to order transi- 
tion in the IRF-3 DBD and a conformational change 
in the IRF-7 DBD. The varying intrinsic flexibility of 
loops and tails in the IRF family may serve as a mechan- 
ism to modulate the binding specificity of its members and 
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to respond to a larger population of diverse promoter 
sites. 
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